Corpus: bre_wikipedia_2007_10K

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 1603 d-
2 1352 h-
3 1237 g-
4 1213 s-
5 1173 k-
Top Character Bigrams
word rank frequency n-gram
1 654 di-
2 365 he-
3 315 ke-
4 295 ha-
5 282 ho-
Top Character Trigrams
word rank frequency n-gram
1 187 c’h-
2 176 dis-
3 125 gou-
4 107 tre-
5 97 ken-
Top Character 4-Grams
word rank frequency n-gram
1 59 c’ho-
2 54 penn-
3 51 c’he-
4 47 disk-
5 45 c’ha-
Top Character 5-Grams
word rank frequency n-gram
1 34 treuz-
2 27 Sant--
3 27 niver-
4 24 heñve-
5 22 labou-
459 msec needed at 2017-11-29 21:38